Making real-time content selection decisions based on multivariate testing

ABSTRACT

Server side web based ad stitching and user specific optimization are disclosed. In various embodiments, a request for a page is received via a communication interface. A processor is used to provide an optimized version of the page at least in part by determining dynamically an optimized content element for the page based at least in part on a user attribute associated with the request and content performance data generated based at least in part on testing of alternative versions of the page with respect to other users having the user attribute.

CROSS REFERENCE TO OTHER APPLICATIONS

This application is a continuation in part of co-pending U.S. patent application Ser. No. 16/219,018 entitled SERVER SIDE WEB BASED AD STITCHING AND USER SPECIFIC OPTIMIZATION filed Dec. 13, 2018, which is incorporated herein by reference for all purposes, which claims priority to U.S. Provisional Application No. 62/598,371, entitled SERVER SIDE WEB BASED AD STITCHING AND USER SPECIFIC OPTIMIZATION filed Dec. 13, 2017 which is incorporated herein by reference for all purposes.

BACKGROUND OF THE INVENTION

The web runs slowly because conventionally pages are loaded via output pages from content management systems (CMS) with ad tags which are generally scripts that are loaded on the client's browser that then call other ad servers or ad tech that then inject ads, handle verification and other ad functions. This is all done on the client side so the web page loads slowly. Since ads are dynamically injected from third parties, ad blockers can block these ads by blocking calls to these third party servers.

Publishers also have a difficult time moving away from their CMS systems since they have built their operations and have their content loaded into systems that house the content and manage the layout and rendering of their pages.

BRIEF DESCRIPTION OF THE DRAWINGS

Various embodiments of the invention are disclosed in the following detailed description and the accompanying drawings.

FIG. 1A is a block diagram illustrating an embodiment of a prior art system to serve web pages and ads.

FIG. 1B is a block diagram illustrating an embodiment of a prior art system to distribute web page content.

FIG. 2 is a block diagram illustrating an embodiment of a system to provide user-specific optimized content.

FIG. 3 is a block diagram illustrating an embodiment of a system to provide user-specific optimized content.

FIG. 4 is a block diagram illustrating an embodiment of a content optimization server/service.

FIG. 5A is a block diagram illustrating an embodiment of a system to create and maintain a user knowledge base.

FIG. 5B is a block diagram illustrating an embodiment of a system to create and maintain a content analytics knowledge base.

FIG. 6 is a flow chart illustrating an embodiment of a process to provide user-specific optimized content.

FIG. 7 is a flow chart illustrating an embodiment of a process to generate dynamically a web page comprising user-specific optimized content.

FIG. 8 is a flow chart illustrating an embodiment of a process to determine user-specific content to be included in a dynamically generated optimized web page.

FIG. 9 is a flow chart illustrating an embodiment of a process to perform A/B testing of different versions of a page.

FIG. 10 is a flow chart illustrating an embodiment of a process to evaluate and use the results of A/B testing.

FIG. 11 is a flow chart illustrating an embodiment of a process to perform testing relative to a control group that receives an un-optimized version of a content page.

FIG. 12 is a flow chart illustrating an embodiment of a process to evaluate and use the results of testing relative to a control group.

FIG. 13 is a flow chart illustrating an embodiment of a process to determine dynamically whether to serve an optimized version of a page.

DETAILED DESCRIPTION

The invention can be implemented in numerous ways, including as a process; an apparatus; a system; a composition of matter; a computer program product embodied on a computer readable storage medium; and/or a processor, such as a processor configured to execute instructions stored on and/or provided by a memory coupled to the processor. In this specification, these implementations, or any other form that the invention may take, may be referred to as techniques. In general, the order of the steps of disclosed processes may be altered within the scope of the invention. Unless stated otherwise, a component such as a processor or a memory described as being configured to perform a task may be implemented as a general component that is temporarily configured to perform the task at a given time or a specific component that is manufactured to perform the task. As used herein, the term ‘processor’ refers to one or more devices, circuits, and/or processing cores configured to process data, such as computer program instructions.

A detailed description of one or more embodiments of the invention is provided below along with accompanying figures that illustrate the principles of the invention. The invention is described in connection with such embodiments, but the invention is not limited to any embodiment. The scope of the invention is limited only by the claims and the invention encompasses numerous alternatives, modifications and equivalents. Numerous specific details are set forth in the following description in order to provide a thorough understanding of the invention. These details are provided for the purpose of example and the invention may be practiced according to the claims without some or all of these specific details. For the purpose of clarity, technical material that is known in the technical fields related to the invention has not been described in detail so that the invention is not unnecessarily obscured.

Techniques are disclosed to compose in real time and serve a quick-loading, personalized version of a web page. In various embodiments, a publisher's web page is retrieved and parsed to determine its structure and to identify and extract content elements. A request for the web page associated with a user is responded to at least in part by composing in real time a version of the page personalized for the user. In some embodiments, publisher content elements may be selected and/or reordered based at least in part on information about the user, e.g., demographic data, user-specific content consumption metrics, page-related consumption metrics, other analytics, etc. In some embodiments, ads or other third party content elements (native ads, targeted ads, product offerings, sponsored ads or articles, banner ads, sponsored video, and/or other sponsored content) are selected based at least in part on the user. Ad content is stitched into the personalized page, prior to serving the page, and ad-related scripts are stripped out of and/or omitted from the page resulting in a much faster-loading page as experienced at the user/client/browser end. In some embodiments, limited scripts may be included in the page, such as scripts to verify ad content was displayed and/or viewed at the user/client/browser, scripts to monitor content consumption, etc. In some embodiments, the system optimizes the type of script being added to the page based on what type of ads are being returned, reducing the code running on the page.

In various embodiments, AB testing is performed. For example, a first optimized version, the “A” version, may be sent to one subset of a set of similar users, while a second subset of the set of similar users may receive an alternate or “B” version of the same page. In some embodiments, the “B” version may be a differently-optimized set of content. For example, the “B” version may include different content items, such as different publisher content items, different ads, and/or other different content than the “A” version. The “B” version may include the same or some of the same content items as the “A” version, but in different locations on the page.

In some embodiments, the “B” version may be a version of the page that has not been optimized and/or has not been as fully optimized. For example, in some embodiments, the “B” version may include one or more scripts required to be executed at the client side, while the “A” version of the page may be served without those scripts, which instead may have been executed at the server side to provide an “A” version of the page that does not require the client to execute the scripts.

FIG. 1A is a block diagram illustrating an embodiment of a prior art system to serve web pages and ads. In the example shown, a client 102, e.g., a computer system, mobile device, etc. using browser software and/or a mobile app with an embedded browser, is used to request (arrow “1”) a publisher page from a publisher's content management system 106. For example a user may have selected a link or entered in a navigation or search field a URL associated with the publisher page. The publisher content management system 106 returns an HTML page (arrow “2”) that includes the content and/or further links/scripts to retrieve additional content. The client 102 begins to render the page and executes scripts included in the page, e.g., JavaScript™, resulting in one or more ad calls (arrow “3”) being sent to the page publisher's ad server 108 (e.g., DoubleClick™ for Publishers, Google™ Ad Manager, or another hosted ad server), which returns responsive ad information, such as an ad tag indicated an ad selected to be displayed to the user. In a typical scenario, there may be multiple calls to the publisher ad server 108, e.g., loading JavaScript™, making 1-2 calls until the third party ad tag is returned, etc. The response from the publisher's ad server 108 directs the client 102 to a third party ad server 110 to obtain ad creative content, e.g., video, images, scripts, etc. (arrows “5”, “6”). The client/browser 102 only then has all the (publisher plus ad) content required to fully render the originally requested page.

In the example shown in FIG. 1A, the client/browser 102 interacts with five different remote entities (106, 108, 110) to obtain and render the originally requested page. Each interaction is made over a relatively slow client-server connection from the client/browser 102 to the remote node, and multiple interactions are done to obtain and render content comprising the originally requested page. In a typical real world scenario, many more interactions than those shown in FIG. 1A may be required, e.g., to track ad impressions, etc.

FIG. 1B is a block diagram illustrating an embodiment of a prior art system to distribute web page content. In the example shown in FIG. 1A, the content distribution network 112 may be implemented as shown in FIG. 1B. In the example shown, content distribution network (CDN) 120 includes a CDN distribution server 124, which is configured to retrieve web page content from publisher servers, such as publisher web server 122. When a request for a page is received, for example, the CDN 120 pulls the latest page version from the publisher CMS/web server and may or may not cache the page for further/future distribution. CDN distribution server 124 distributes via network 132 and stages copies of the content on geographically distributed CDN edge servers such as CDN edge servers 126, 128, and 130. Clients requesting a web page associated with the publisher web server 122 (or other publishers) may be directed to obtain the content from nearby CDN edge servers. In the example shown, for example, clients 134 and 136 may be directed to CDN edge server 126, clients 138 and 140 to CDN edge server 128, and client 140 to CDN edge server 130. In this manner, a requesting client may be directed to obtain a copy of the content from a CDN edge server that is nearer to the requesting client (e.g., in terms of geography, region, distance, network topology, etc.) than other CDN edge servers and/or the publisher web server 122.

While the use of CDN's such as CDN 120 of FIG. 1B enables content to be downloaded more quickly to requesting clients than if a CDN were not used, as FIG. 1A illustrates in the conventional approach a client/browser still must execute a number of scripts and engage in a series of interactions with multiple nodes to obtain and render the publisher and other (e.g., ad) content comprising a web page.

Techniques are disclosed to generate and serve in real time a quick-loading web page comprising user-specific, optimized content. In various embodiments, a CDN or other edge server is configured to provide in response to a request, e.g., from a client/browser associated with a user, a page comprising content optimized for the user. In some embodiments, one or more scripts associated with obtaining ads to be rendered on the page are omitted from the page as sent to the client/browser. Instead, ad content to be displayed to the user in the context of the page is obtained on the server side and included in the page prior to the page being served to the client/browser. Conventional techniques to identify ad content for the user may be performed in various embodiments on the server side, prior to serving the page, to enable ad content targeted to the requesting user to be identified, obtained, and integrated into the page prior to the page being served to the client/browser. In some embodiments, cache headers are used to ensure that the page returned to the specific user/browser will be used once and only once, allowing the content (e.g., the ads) to be updated for every new request made by a browser.

In various embodiments, publisher content optimized for the user may be determined, obtained, and included in a page prior to being served to the client/browser, as disclosed herein. In some embodiments, user- and/or publisher content element-specific metrics may be used to determine for a given user with respect to a given page request the publisher content elements to be included in the page as served and/or the order and/or placement of publisher and/or ad content elements comprising the page.

In some embodiments, content publishers are enabled to customize the site-ads and editorial for each user. The position of the editorial can be varied on each subsequent visit, frequency cap the appearance of articles for that user or move the top article to the bottom position if the user already read that article. In some embodiments, user behavior and location data are used to optimize the editorial and ad layout of the sites and automatically optimize it based on subsequent engagement data.

In various embodiments, ads or other content may be integrated into a publisher's page using one or more techniques described in U.S. Pat. No. 9,286,662, issued Mar. 15, 2016, entitled Press Release Distribution System, the entire contents of which are incorporated herein by reference for all purposes.

In various embodiments, techniques disclosed herein may be used to add a new presentation layer to a website or other content page. The page content and/or ads are analyzed and a new layout or design applied. The new layout or design may be decided by a person or automatically designed by artificial intelligence, machine learning, or other automated processes. In some embodiments, a website can be redesigned in an optimized way for various users without having to move to a new web development system. In some embodiments, a design template is applied to an existing site no matter what content management system (CMS) that site is hosted on and the site layout or presentation is changed without moving to content to a new CMS. Use cases include without limitation a commerce site that want to have a completely different look for the holidays, change their brand presentation, try radical new product presentations, etc.

In some embodiments, third party verification (e.g., JavaScript™) calls and/or other function calls are made at the server, enabling a page to omit the code configured to make the calls from a client to which the page is served.

FIG. 2 is a block diagram illustrating an embodiment of a system to provide user-specific optimized content. In the example shown, system 200 includes a client/browser 202 that requests (arrow “1”) a publisher page. The request is resolved (e.g., through DNS routing, such as CNAME) to a CDN edge server 206. CDN edge server 206 is configured to obtain publisher content for the page from the publisher's content server 208, e.g., the publisher's content management system (CMS) (arrows “2” and “3”). In various embodiments, the CDN edge server 206 may be configured to cache the publisher page, e.g., for a prescribed time.

The CDN edge server 206 provides the publisher page (arrow “4”) to a real time content optimization server 210, along with the original page URL (i.e., the request by the user associated with client/browser 202) and information about the requesting user (e.g., from a cookie on the client/browser 202). In some alternative embodiments, the real time content optimization server 210 obtains the publisher page directly from the publisher content server 208. In various embodiments, real time content optimization server 210 parses the publisher page to determine its structure and to extract component publisher content elements (e.g., articles, images, video, etc.) and advertising-related and other scripts. In some embodiments, scripts are parsed to determine meta-information required to identify and obtain ad content, e.g., ad content targeted for the user based on the user information received by the real time content optimization server 210 from the CDN edge server 206. In some embodiments, observed and/or user-configured preferences of the user may be used by the real time content optimization server 210 to identify/select ads for the user.

In various embodiments, real time content optimization server 210 interacts (arrows “5” and “6”) with a third party ad server 212 to identify one or more ads to be included in the page, and obtains ad content (arrows “7” and “8”) from a source of ad content 214. In some alternative embodiments, only ad HTML is injected to the page, and when the browser 202 renders the page (with the ads), the browser 202 fetches the relevant multiple-media files, such as images, from the source of ad content 214. The real time content optimization server 210 in various embodiments injects/includes the obtained ad content in the ad locations (slots) it selects, e.g., based on publisher rules and/or criteria in some embodiments, to be included in the page.

In some embodiments, real time content optimization server 210 optimizes publisher content as included in the page. For example, real time content optimization server 210 may adapt the editorial (publisher) content to better fit the user's preference. This may include, but is not limited to, promoting articles the user is likely to read, demoting articles already read by this specific user etc.

In various embodiments, consumption of publisher content elements, by an individual user and/or across users, is tracked and used to optimize the selection and/or placement of content. In some embodiments, content consumption is tracked as disclosed in U.S. Pat. No. 9,912,768, entitled MEASURING CONTENT CONSUMPTION, issued Mar. 6, 2018, the entire contents of which are incorporated herein by reference for all purposes.

In various embodiments, real time content optimization server 210 composes and returns to the CDN edge server 206 (arrow “11”) a flattened (e.g., ad-related JavaScript and/or other scripts omitted) and optimized (e.g., publisher content elements ads selected and/or placement optimally for the requesting user) page. The CDN edge server 206 serves the page (arrow “12”) to the client/browser 202, which is able to render the page relatively quickly, since at least some of the ad-related scripts are not included and the required publisher and ad content are included in the page as served. In addition, techniques disclosed herein result in a page being served in which the page HTML is not being changed while the browser needs to render it, since the ads have already been injected, so within a single run, the entire page and ads are rendered.

In some embodiments, ads or other content compatible with the layout of a publisher's page(s) may be determined programmatically. For example, the structure/layout of the publisher's page(s) may be learned and a representation of the structure stored. In some embodiments, layout changes may be detected automatically. For example, the publisher's pages may be crawled periodically and/or on an ongoing basis, and new and/or newly-structured pages may be detected and their structure learned and stored. In some embodiments, techniques are used to quickly extend to other publisher pages page structure information gleaned previously from other pages having a same/similar structure, e.g., other pages of that same publisher. In some embodiments, page structure may be learned as disclosed in U.S. Pat. No. 9,880,994, entitled DETECTING COMPATIBLE LAYOUTS FOR CONTENT-BASED NATIVE ADS, issued Jan. 30, 2018, the entire contents of which are incorporated herein by reference for all purposes.

FIG. 3 is a block diagram illustrating an embodiment of a system to provide user-specific optimized content. The system 300 is similar in some respects to the system 200 of FIG. 2, except that in the example shown in FIG. 3 the real time content optimization server and/or its function(s) has/have been integrated into a content optimizing CDN edge server 306.

In the example shown in FIG. 3, a client/browser 302 requests (arrow “1”) a publisher page and request is resolved to a content optimizing CDN (or other) edge server 306. Edge server 306 includes real time content optimization service 308 (e.g., as co- or locally hosted server, a module, etc.). In various embodiments, real time content optimization service 308 determines publisher and ad content for and composes a page optimized for the requesting user, e.g., as described above in connection with real time content optimization server 210. Specifically, in the example shown, real time content optimization service 308 obtains publisher page structure information and/or publisher content elements from publisher content server 310 (arrows “2” and “3”); identifies ads targeted to the user via a call to ad server 312 (arrows “4” and “5”); and obtains ad creative content from ad content server/network 314 (arrows “6” and “7”). Real time content optimization service 308 uses the obtained page structure information, publisher content element information and content, and ad information and content to compose a flattened (e.g., some/all ad scripts omitted, publisher and ad content included statically) page to be served to the requesting client/browser 302 (arrow “8”). In some alternative embodiments, the ad creative content (video, images, etc.) is obtained by the client/browser 302 directly from a source of ad content, e.g., ad content 314.

In some embodiments, a system similar to that shown in FIG. 3 is used, except that the real time optimization service 308 is hosted in a separate server interposed between the CDN edge server 306 and the publisher content server 310, ad server 312, and ad content 314. In some such embodiments, the CDN edge server 306 does not retrieve page content for a publisher page from the publisher content server 310 (e.g., publisher's CMS) and instead obtains the page, fully flattened (e.g., ads stitched in, etc.) as disclosed herein, from the real time optimization service 308.

FIG. 4 is a block diagram illustrating an embodiment of a content optimization server/service. In various embodiments, real time content optimization service 308 of FIG. 3 and/or real time content optimization server 210 of FIG. 2 may be implemented at least in part as shown in FIG. 4. In the example shown, real time content optimization service 308 includes a request handler 402 that processes requests 404 for publisher pages received via an internal or external (e.g., network) communication interface. Request handler 402 passes the page request (e.g., URL, requesting client) and associated user information (e.g., user identifier or other identifier, e.g., from a cookie or other source) to content optimization module 406. In various embodiments, content optimization module 406 comprises a set of one or more processes and associated data structures configured to determine page structure and content (publisher and/or ad content) to compose and serve a page with content optimized for the requesting user, as disclosed herein.

In the example shown, content optimization module 406 uses page metadata stored in page metadata database 412 to determine the page structure of the requested page. In various embodiments, the page structure may have been learned by previously crawling and parsing the page and/or a page having a similar structure, and storing a representation of the page structure in page metadata database 412. In some embodiments, if the page structure is not already known the page is retrieved, e.g., from the publisher's content management system, and the page is parsed and analyzed in real time to determine its structure. For example, processing may be performed to determine the page is similar in structure to a page processed previously and for which a representation of the page structure was stored previously. Or, the page structure may simply be determined in real time. In some embodiment, page structure is determined in the background and the page is returned “as is” for the first few calls, until the structure of the page has been learned.

In this example, content optimization module 406 consults and applies publisher rules 414 to determine optimized content for the page. Publisher rules 414 may be configured a priori by or on behalf of the publisher to indicate the publisher's preferences and/or requirements as to which content elements on which pages may be optimized in which manner. For example, a publisher may prescribe or otherwise define a rule that limits the reordering of content elements so that the topmost content element is never moved, is always moved, and/or may (only) under prescribed conditions be moved to a lower position in the page. In another example, a publisher may establish a rule that no more than a prescribed number or proportion of content elements may be replaced and/or reordered, or that content elements in adjacent/consecutive page locations may not be replaced. Additional examples of publisher rules include, without limitation, rules based on geolocation of the user, time of day, day of the week, or season of the year; rules based on the number of times the same user has been shown the same or similar content; etc.

Content optimization module 406 also uses information about the user with whom the request is associated. User identify information received with the request 404 is used to access information about the user from a user knowledge base 416. In various embodiments, user knowledge base 416 may contain user-specific information about the individual user. For example, information read from cookies placed previously on the user's browser may be used to observe the user's browsing history and/or habits.

In some embodiments, the real time content optimization service 308 receives from the client a request that includes first-party (publisher) cookies associated with the publisher's site domain and ad server cookies. The real time content optimization service 308 identifies the source of each cookie and sends each to its owner. The real time content optimization service 308 merges received cookies and cookies being written as first party (under the site domain) and returns a response. In some embodiments, real time content optimization service 308 adds an additional cookie sync pixel for first time users. The pixel sends a request to a third party ad server (with the third-party cookie). The ad server returns third-party cookie information and redirects back to the real time content optimization service 308 to complete the cookie sync process.

In some embodiments, tags or code on or associated with publisher pages served to the user may generate content consumption metrics specific for that user with high granularity. For example, the amount of time the user spent interacting with each specific content element on a page (e.g., video clip, image, retailer product browsing widget, other ads or sponsored content, individual article summaries, etc.) may be observed and reported and used update the user's record in the user knowledge base 416. In some embodiments, demographic, geolocation, and/or other information about the user may be used to access from user knowledge base 420 information about what content the user may respond to more favorably, e.g., based on observation of content consumption metrics across users in that same demographic, geographic region, etc.

User knowledge base 416 may also include information identifying which content elements (publisher content, ads) the user has already seen, e.g., in previously-accessed publisher pages, and/or how many times and in which locations the content was displayed. In some embodiments, this information may be used to determine placement within the page of a content element. For example, an article the user has seen once before may be placed lower in the page, or may be placed lower or even omitted based on previously observed interaction of the user with that content.

In the example shown, user information is used along with content analytics data 418 to select optimized content for the user. For example, the performance of each individual content element, whether publisher content or ad content, may be tracked over time and content elements determined to generate higher interest from users and/or from users like the requesting user may be more likely to be included in the page as composed for and served to this requesting user.

In various embodiments, the user knowledge base 416 and content analytics data 418 are updated continuously and/or periodically by analytics module/interface 420. In various embodiments, analytics module/interface 420 may receive raw content consumption and/or user data and analyze the data locally to populate and/or update user knowledge base 416 and/or content analytics data 418, and/or analytics module/interface 420 may receive a report or representation of analysis performed centrally and sent to analytics module/interface 420 via communications interface 422.

In various embodiments, content optimization module 406 uses one or more of the request 404 and associated user information received from request handler 402; page metadata 412; publisher rules 414; user knowledge base 416; and content analytics 418 to determine optimized content to be included in a page to be served in response to the request 404. In some embodiments, a real time call may be made to an ad server, such as ad server 312, to determine an optimal, targeted ad to be included in the page. In some alternative embodiments, an ad may be selected based on page metadata 412, which may include continuously updated data concerning ads to be targeted to users based on user attributes, other user information, and/or page information. In some embodiments, an ad may be determined in real time via an auction or other competitive marketplace, conducted by the real time content optimization service 308 or a third party service. In various embodiments, the functionality of a full ad stack (multiple ad formats, all targeting capabilities, auctions, etc.) are embedded into a server side process, as disclosed herein.

In the example shown in FIG. 4, the content optimization module 406 passes to page assembly module 408 the page structure and identification and placement of publisher and ad content elements to be included in the page. Page assembly module 408 retrieves the publisher content from a publisher content store 424 and the ad content from ad content store 426, and uses the page structure and placement information to compose the page.

In various embodiments, publisher content may be retrieved in advance by publisher content retrieval module and/or process 428, via communication interface 430. For example, over time publisher pages may be crawled by publisher content retrieval module and/or process 428 and the pages parsed to extract individual content elements that may be subject to be reordered and/or replaced by sponsored (e.g., ad) content. Page metadata 412 may be updated to reflect the page structure and/or to include identification of the content elements and/or attributes of and/or other metadata relating to the individual content elements. In some embodiments, the latter metadata may be used to identify and select the content element(s) to be included in a given page in response to a request. In some embodiments, if not already presented publisher content for a page may be retrieved in real time, and the page processed in real time to extract the page structure and content elements.

In some embodiments, the caching of pages is done based on the publisher CMS cache settings. If the publisher configures its CMS to allow caching for certain amount of time (like age HTTP header), or allows caching and just verifying the page (e.g., using ETAG) the content optimization stores a local copy. “Prefetching” and “crawling” as described herein are used in various embodiments to understand what other content exists on the publisher's site, including content that might not be promoted currently on all pages, and is available to be presented to a specific user as part of the rules/recommendations from the content optimization module.

Likewise, in various embodiments ad content 426 may be obtained in advance and cached, in this example by ad content retrieval module and/or process 432 via communication interface 434, and/or may be obtained in real time. In some embodiments, ads are not cached, and are instead selected and retrieved dynamically per request, based on the user, geolocation, time considerations (e.g., pacing), etc.

In various embodiments, the optimized page composed by page assembly module 408 as described herein is provided to the request handler 402 and is returned to the requesting user as page 410. In various embodiments, the page 410 includes content optimized for the user as disclosed herein. The page 410 in various embodiments includes no JavaScript™ or other code required to be executed by the requesting client/browser to retrieve ads or publisher content. In some embodiments, limited JavaScript™ or other code may be included at the bottom of the page, e.g., to track and report content consumption metrics as described herein, to provide verification that an ad or other content has been displayed to the user, etc.

In some embodiments, ad verification code is not included on the page. Instead, real time content optimization service 308 and/or one or more other components of content optimizing edge server 306 execute the verification script on behalf of the requesting client and/or otherwise provides verification to the advertiser and/or ad service or network.

In various embodiments, providing a page with content optimized for the user and minimal scripts provides a page having content much more likely to be of interest to the user while providing a more desirable user experience since the page loads and displays fully much more quickly. In various embodiments, caching publisher and/or ad content on the edge server 306 and/or offloading to the much higher capacity processors and network connections of the edge server 306 tasks such as publisher content retrieval, ad identification, and ad content retrieval, enables an optimized page to be served and displayed much more quickly than under prior approaches, such as those illustrated in FIG. 1A.

In various embodiments, an optimized page provided to a first user as disclosed herein may be cached for a limited time, e.g., at an edge server such as edge server 306 of FIG. 3. In some embodiments, such a cached page may be served to a second user determined to be (sufficiently) similar to the first user, e.g., if a request for the same publisher page is received from the second user while the page remains available in the cache. For example, a vector or other representation of the user attributes of the first user may be stored and associated with the cached page. A subsequent request received from the second user may be processed in part by determining a corresponding vector for the second user. If the distance/difference between the vectors is less than a prescribed threshold, the cached page is served to the second user. If not, a page optimized for the second user is constructed and served, as disclosed herein.

In some embodiments, techniques disclosed herein are used to determine optimal content and/or placement across users. For example, “A/B” testing may be performed by serving versions A and B of a page to different and/or overlapping sets of similar users. User engagement for a content element of interest is measured, and the page and/or content version that performs best may be served to subsequent users who request the page.

FIG. 5A is a block diagram illustrating an embodiment of a system to create and maintain a user knowledge base. In the example shown, user knowledge base 416 is shown to include data reflecting user browsing history 502, user content consumption metrics 504, demographic information 506, and user profile/preference information 508. In various embodiments, user browsing history 502 is determined from one or more cookies placed previously on the user's browser. User content consumption metrics 504 in various embodiments is gathered by scripts and/or agents running on the user's client/browser and includes information reflecting user interaction with ads or other content elements, such as time displayed, amount of video viewed, etc. Demographic information 506 may include information known, learned, or inferred about the user, e.g., based on content consumption habits, purchases, etc. In some embodiments, a location is associated with the request, e.g., based on the source IP address. User profile/preference data 508 may in some embodiments include preferences explicitly indicated by the user (e.g., don't show me ads or content like this; or expressing interest in user-selected subjects, etc.) and/or user demographic or other information expressly indicated and/or learned or inferred about the user.

FIG. 5B is a block diagram illustrating an embodiment of a system to create and maintain a content analytics knowledge base. In this example, content analytics data 418 includes content consumption metrics 522 and ad content performance 524. In some embodiments, publisher content and ad content are treated similarly, as content elements available to be included in an optimized page. In some embodiments, content analytics data 418 includes normalized data enabling the performance and/or level of interest for a given user of content elements to be determined and compared, whether they comprise publisher content or ad content.

FIG. 6 is a flow chart illustrating an embodiment of a process to provide user-specific optimized content. In various embodiments, the process of FIG. 6 is performed wholly and/or in part by an edge server, such as edge server 306 of FIG. 3. In the example shown, a request for a publisher page is received (602). A page optimized for a user with whom the request is associated is generated dynamically (604) and served to the user (606).

FIG. 7 is a flow chart illustrating an embodiment of a process to generate dynamically a web page comprising user-specific optimized content. In various embodiments, the process of FIG. 7 is performed to implement step 604 of the process of FIG. 6. In the example shown, attributes of a user with whom a page request is associated are determined (702). A page structure of the requested page is determined (704). Publisher content appropriate for the page and optimized for the user is obtained (706). Ad content targeted for the user is determined and obtained (708). The page structure, publisher content, ad content, and content placement determined to be optimized for the user are used to compose a page optimized for the user (710).

FIG. 8 is a flow chart illustrating an embodiment of a process to determine user-specific content to be included in a dynamically generated optimized web page. In various embodiments, the process of FIG. 8 is used to implement step 706 of the process of FIG. 7. In the example shown, publisher pages are ingested (e.g., crawled) (802). The publisher pages are parsed to determine page structure and extract content elements (804). In some embodiments, the page structure information is stored in a database, such as page metadata store 412 of FIG. 4. Extracted content elements may be cached in a publisher content store, such as publisher content store 424 of FIG. 4. Content elements optimized for the user, based on the user's attributes, are selected to be included in a page to be served in response to the request (806).

Techniques are disclosed to generate and serve a page, dynamically and in real time, based at least in part on performance testing of two or more versions of the page as served to prior users.

In various embodiments, a system as disclosed herein continuously and/or periodically runs A/B testing on a small percentage of the traffic. In some embodiments, a control group of users receive a non-optimized version of the page that uses regular browser side injection to determine, retrieve, and render ads within the page. A second group (with similar size) is used as a comparison by pages generated using server side stitching as described herein. Based on historical data of this A/B testing the system in various embodiments can:

-   -   Calculate the performance improvement (e.g., in terms of ad         revenue generation; user experience, such as page load speed;         improved viewability of ads or other content; etc.) of the         solution for different sites, placements, devices, browsers etc.         (on real live traffic)     -   Based on the above, in various embodiments the system decides         what solution will provide better yield for the publisher and         can optimize accordingly.

In various embodiments, one or more of the following optimizations are implemented:

-   -   The system identifies that for users with no first party cookie         on the relevant site (user we see on a site for the first time),         it is better (or better in certain scenarios), yield wise, to         run the old/standard browser side ad injection (as it allows us         to leverage third party cookie that might already exist).     -   Similar decision are made for different browsers, placements         etc. that might yield better results for publisher (for example,         below the fold placements might be better of lazy injected on         the browser side to reduce load from the page and improve         latency)     -   The system can also optimize where and how to inject the various         tracking script (that are still required to collect information         about ad served) to improve yield

FIG. 9 is a flow chart illustrating an embodiment of a process to perform A/B testing of different versions of a page. In various embodiments, the process of FIG. 9 is performed by a server/service configured to compose and serve optimized content pages, such as real time content optimization service 308 of FIG. 3 and/or real time content optimization server 210 of FIG. 2. In the example shown, a page request is received (902) and a decision is made whether the request will be included in a test set (904). If so, a test version of the page is composed and served (906). Examples of a test version include, without limitation, an original version of the page the content of which has not been optimized, such as a version that executed scripts at the client side to determine and retrieve ads or other content; and a version of the page in which content items other than those that would have been included in a page optimized based on current content/user knowledge base information and/or in which content items are arranged in locations other than in a corresponding optimized version concurrently and/or contemporaneously being served in response to similar requests (e.g., same URL, similar user attributes, etc.). Metrics (e.g., content consumption, ad views, click through, other user engagement, etc.) are received and associated with a test (e.g., A/B or other version testing) with respect to which the process of FIG. 9 is being performed (908). If a received request is not selected (e.g., randomly) to be included in the test set (902, 904), a content page the content of which is optimized as disclosed herein based on the then-current state of the content and/or user knowledge base is generated and served (910) and associated metrics are received and processed (912). In some embodiments, metrics for at least a subset of users that received the normally optimized version of the page are selected and used to compare to corresponding statistics received at 908.

FIG. 10 is a flow chart illustrating an embodiment of a process to evaluate and use the results of A/B testing. In various embodiments, the process of FIG. 10 is performed by a server/service configured to compose and serve optimized content pages, such as real time content optimization service 308 of FIG. 3 and/or real time content optimization server 210 of FIG. 2. In some embodiments, the process 1000 is performed by an analytics process, module, or service, such as analytics module 408 of FIG. 4. In some embodiments, the process 1000 is performed to maintain, improve, and/or update one of both of a user knowledge base, such as user knowledge base 416 of FIG. 4 and a content knowledge base, such as content analytics 418 of FIG. 4.

In the example shown in FIG. 10, metrics associated with control group (e.g., an “A” user group that received a version “A” of a content page) and test group (e.g., a “B” user group that received a version “B” the same content page) are retrieved (1002). The relative performance of the “A” and “B” versions of the content page are evaluated (1004) and the result of the evaluation/comparison of the relative performance of the versions is used to update one or both of a user and a content knowledge database (1006).

For example, if the “A” version was not optimized by server side selection, arrangement, and/or retrieval and server side injection of ads or other content items as disclosed herein and the “B” version was optimized, or if the “A” version was optimized in a different way than the “B” version (e.g., different content items, ads, placement, etc.), in various embodiments the user and/or content knowledge base may be updated based on which version(s) performed better under which conditions and with respect to which subsets (e.g., based on demographics, etc.) of users.

While in the example shown in FIG. 10 the results associated with two versions of a content page (i.e., “A” and “B” versions) are compared, in various embodiments metrics associated with more than two versions of the content page may be retrieved and compared to update the user and/or content knowledge base.

In various embodiments, techniques disclosed herein are used to evaluate the performance of optimization techniques as disclosed herein relative to corresponding pages that have not been optimized as disclosed herein. In some embodiments, such comparison enables the performance “lift” provided by applying content page optimization techniques described herein to be measured.

FIG. 11 is a flow chart illustrating an embodiment of a process to perform testing relative to a control group that receives an un-optimized version of a content page. In various embodiments, the process of FIG. 10 is performed by a server/service configured to compose and serve optimized content pages, such as real time content optimization service 308 of FIG. 3 and/or real time content optimization server 210 of FIG. 2. In the example shown, upon receiving a page request (1102) a determination is made as to whether the request is selected to be part of a control group (1104). If so, a version of the content page that has not been optimized as disclosed herein is served (1106). If the request is not associated with the control group (1104), a version of the page that has been optimized as disclosed herein is generated and served (1108).

FIG. 12 is a flow chart illustrating an embodiment of a process to evaluate and use the results of testing relative to a control group. In the example shown, performance metrics associated with un-optimized and optimized versions of a content page are retrieved (1202). The results are compared to measure the performance “lift” provided by applying server side optimization techniques described herein and/or to identify any circumstances (subset of users, time of day, location, etc.) in which the un-optimized version performed as well or better (1204). A user and/or content knowledge base is updated based on the comparison (1206). Optimally, a report of results of content page optimizations is generated based on the comparison and provided (1208), e.g. via a report file that can be rendered by an associated application or view and/or via a page displayed via an administrative user interface.

FIG. 13 is a flow chart illustrating an embodiment of a process to determine dynamically whether to serve an optimized version of a page. In the example shown, a page request is received (1302). It is determined whether an optimized or un-optimized version of the page is to be served (1304). For example, it is determined based on the user, page, and/or other context data whether an optimized or un-optimized version of the page is to be served. If an optimized page is determined to be served (1304), an optimized version of the page is generated and served (1306); otherwise, an un-optimized (e.g., ads and other dynamic content retrieved at client by code executed by client/browser) version of the page is served (1308).

In various embodiments, techniques disclosed herein may be used to serve ads optimized for the user, even if the user is using ad blocking technologies to prevent or interfere with ad calls from the browser. In various embodiments, techniques disclosed herein may increase user interest in publisher and/or ad content provided to the user via a publisher's pages, increasing user interest and loyalty to the publisher and its pages, resulting in increased ad revenue and other benefits to the publisher. By enabling ad-related and other JavaScript™ to be omitted from pages as served to a user, and by offloading to much faster server processors and connections the task of determining and retrieving publisher and ad content, a faster page loading and browsing experience may be provided to users who navigate to a publisher's page(s). Use of testing and comparison techniques disclosed herein, in various embodiments, may enable further optimizations to be achieved and the benefits of optimization as described herein to be measured and evaluated.

Although the foregoing embodiments have been described in some detail for purposes of clarity of understanding, the invention is not limited to the details provided. There are many alternative ways of implementing the invention. The disclosed embodiments are illustrative and not restrictive. 

What is claimed is:
 1. A system, comprising: a communication interface configured to: receive a request for a page; and obtain an original version of the page from a publisher in response to the request; and a processor coupled to the communication interface and configured to provide an optimized version of the page at least in part by determining dynamically an optimized content element for the page based at least in part on a user attribute associated with the request and content performance data generated based at least in part on testing of alternative versions of the page with respect to other users having the user attribute, wherein the optimized version of the page includes the optimized content element and removes a script included in the original version of the page, wherein the script included in the original version of the page is configured to retrieve the optimized content element, wherein the optimized content element is stitched into the optimized version of the page.
 2. The system of claim 1, wherein the system comprises a server.
 3. The system of claim 1, wherein the system comprises an edge server.
 4. The system of claim 1, wherein the optimized content element is determined based at least in part on a content consumption metric associated with a user.
 5. The system of claim 1, wherein the optimized content element is determined based at least in part on previous user engagement with the optimized content element.
 6. The system of claim 5, wherein the previous user engagement with the optimized content element includes user engagement associated with said testing of alternative versions of the page.
 7. The system of claim 1, wherein a placement of the optimized content element within the optimized version of the page is determined based at least in part on previous user engagement with the optimized content element.
 8. The system of claim 7, wherein the previous user engagement with the optimized content element includes user engagement associated with said testing of alternative versions of the page including one or more versions in which the optimized content element was rendered at a location associated with the placement.
 9. The system of claim 1, wherein the optimized content element is determined based at least in part on a content consumption metric associated with the optimized content element.
 10. The system of claim 9, wherein the content consumption metric is based at least in part on said testing of alternative versions of the page.
 11. The system of claim 1, wherein the user attribute comprises a demographic or other personal attribute of a user.
 12. The system of claim 1, wherein the processor is further configured to determine an optimal placement of the optimized content element within the optimized version of the page.
 13. The system of claim 12, wherein the optimal placement of the optimized content element within the optimized version of the page is determined based at least in part on content performance metric data generated in connection with said testing of alternative versions of the page.
 14. The system of claim 1, wherein a set of alternative versions of the page included in said testing of alternative versions of the page includes a version of the page that has not been optimized.
 15. The system of claim 14, wherein the processor is further configured to compare performance of the optimized version of the page with the version of the page that has not been optimized.
 16. A method, comprising: receiving via a communication interface a request for a page; obtaining via the communication interface an original version of the page from a publisher in response to the request; and using a processor to provide an optimized version of the page at least in part by determining dynamically an optimized content element for the page based at least in part on a user attribute associated with the request and content performance data generated based at least in part on testing of alternative versions of the page with respect to other users having the user attribute, wherein the optimized version of the page includes the optimized content element and removes a script included in the original version of the page, wherein the script included in the original version of the page is configured to retrieve the optimized content element, wherein the optimized content element is stitched into the optimized version of the page.
 17. The method of claim 16, wherein the optimized content element is determined based at least in part on a content consumption metric associated with a user.
 18. The method of claim 16, wherein the optimized content element is determined based at least in part on previous user engagement with the optimized content element.
 19. The method of claim 16, wherein a placement of the optimized content element within the optimized version of the page is determined based at least in part on previous user engagement with the optimized content element.
 20. A computer program product embodied in a non-transitory computer readable medium and comprising computer instructions for: receiving via a communication interface a request for a page; obtaining an original version of the page from a publisher in response to the request; and using a processor to provide an optimized version of the page at least in part by determining dynamically an optimized content element for the page based at least in part on a user attribute associated with the request and content performance data generated based at least in part on testing of alternative versions of the page with respect to other users having the user attribute, wherein the optimized version of the page includes the optimized content element and removes a script included in the original version of the page, wherein the script included in the original version of the page is configured to retrieve the optimized content element, wherein the optimized content element is stitched into the optimized version of the page. 